AITopics | unlabelled data

Collaborating Authors

unlabelled data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Prediction-powered Inference by Mixture of Experts

Gu, Yanwu, Kong, Linglong, Xia, Dong

arXiv.org Machine LearningMay-1-2026

The rapidly expanding artificial intelligence (AI) industry has produced diverse yet powerful prediction tools, each with its own network architecture, training strategy, data-processing pipeline, and domain-specific strengths. These tools create new opportunities for semi-supervised inference, in which labeled data are limited and expensive to obtain, whereas unlabeled data are abundant and widely available. Given a collection of predictors, we treat them as a mixture of experts (MOE) and introduce an MOE-powered semi-supervised inference framework built upon prediction-powered inference (PPI). Motivated by the variance reduction principle underlying PPI, the proposed framework seeks the mixture of experts that achieves the smallest possible variance. Compared with standard PPI, the MOE-powered inference framework adapts to the unknown performance of individual predictors, benefits from their collective predictive power, and enjoys a best-expert guarantee. The framework is flexible and applies to mean estimation, linear regression, quantile estimation, and general M-estimation. We develop non-asymptotic theory for the MOE-powered inference framework and establish upper bounds on the coverage error of the resulting confidence intervals. Numerical experiments demonstrate the practical effectiveness of MOE-powered inference and corroborate our theoretical findings.

artificial intelligence, estimator, machine learning, (18 more...)

arXiv.org Machine Learning

2604.27892

Country:

North America (0.45)
Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

0918183ced31affb7ce0345e45ac1943-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 11:08:27 GMT

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report (0.94)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

NovelVisualCategoryDiscoverywithDualRanking StatisticsandMutualKnowledgeDistillation-SupplementaryMaterial-BingchenZhao1 KaiHan2,3,4

Neural Information Processing SystemsFeb-11-2026, 00:37:31 GMT

Itcan be seen, except the extreme case with very smallk (e.g.k = 1), the results are generally stable, further corroborating the robustness of ranking statistics. We also carry out experiments using "hard" and "soft" cosine similarity. For the "hard" cosine similarity, we simply adopt athreshold (0.9 inour experiments) onthe score toget binary pseudo labels. While for the "soft" cosine similarity, we directly take the score as soft pseudo labels. Wechoose tousesoftranking statistics because webelievethe continuous similarity better reflect the actually similarity of objects than the binary score. This is important for the pairs with a similarity score around 0.5, for which the binary score is not very reliable.

artificial intelligence, dataset, statistics, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.91)

Add feedback

NeuralViewSynthesisandMatching forSemi-SupervisedFew-ShotLearningof3DPose

Neural Information Processing SystemsFeb-8-2026, 06:54:52 GMT

Ourmodel is trained in an EM-type manner alternating between increasing the 3D pose invariance ofthefeature extractor andannotating unlabelled data through neural viewsynthesis andmatching.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Maryland (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.30)

Add feedback

0918183ced31affb7ce0345e45ac1943-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 08:56:25 GMT

dataset, international conference, learning, (13 more...)

Neural Information Processing Systems

Country:

Africa (0.04)
Europe > France (0.04)
Asia > Nepal (0.04)
Asia > Indonesia (0.04)

Genre: Research Report (0.94)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Neural Information Processing SystemsDec-24-2025, 00:13:58 GMT

We study the problem of learning to estimate the 3D object pose from a few labelled examples and a collection of unlabelled data. Our main contribution is a learning framework, neural view synthesis and matching, that can transfer the 3D pose annotation from the labelled to unlabelled images reliably, despite unseen 3D views and nuisance variations such as the object shape, texture, illumination or scene context. In our approach, objects are represented as 3D cuboid meshes composed of feature vectors at each mesh vertex. The model is initialized from a few labelled images and is subsequently used to synthesize feature representations of unseen 3D views. The synthesized views are matched with the feature representations of unlabelled images to generate pseudo-labels of the 3D pose.

name change, neural view synthesis and matching, semi-supervised few-shot learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Wasserstein distance based semi-supervised manifold learning and application to GNSS multi-path detection

Blais, Antoine, Couëllan, Nicolas

arXiv.org Machine LearningDec-8-2025

The main objective of this study is to propose an optimal transport based semi-supervised approach to learn from scarce labelled image data using deep convolutional networks. The principle lies in implicit graph-based transductive semi-supervised learning where the similarity metric between image samples is the Wasserstein distance. This metric is used in the label propagation mechanism during learning. We apply and demonstrate the effectiveness of the method on a GNSS real life application. More specifically, we address the problem of multi-path interference detection. Experiments are conducted under various signal conditions. The results show that for specific choices of hyperparameters controlling the amount of semi-supervision and the level of sensitivity to the metric, the classification accuracy can be significantly improved over the fully supervised training method.

application, sup, wasserstein distance, (15 more...)

arXiv.org Machine Learning

2512.05567

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.05)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
(2 more...)

Genre: Research Report > New Finding (0.49)

Industry: Education (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Mitigating the Antigenic Data Bottleneck: Semi-supervised Learning with Protein Language Models for Influenza A Surveillance

Xu, Yanhua

arXiv.org Artificial IntelligenceDec-8-2025

Influenza A viruses (IAVs) evolve antigenically at a pace that requires frequent vaccine updates, yet the haemagglutination inhibition (HI) assays used to quantify antigenicity are labor-intensive and unscalable. As a result, genomic data vastly outpace available phenotypic labels, limiting the effectiveness of traditional supervised models. We hypothesize that combining pre-trained Protein Language Models (PLMs) with Semi-Supervised Learning (SSL) can retain high predictive accuracy even when labeled data are scarce. We evaluated two SSL strategies, Self-training and Label Spreading, against fully supervised baselines using four PLM-derived embeddings (ESM-2, ProtVec, ProtT5, ProtBert) applied to haemagglutinin (HA) sequences. A nested cross-validation framework simulated low-label regimes (25%, 50%, 75%, and 100% label availability) across four IAV subtypes (H1N1, H3N2, H5N1, H9N2). SSL consistently improved performance under label scarcity. Self-training with ProtVec produced the largest relative gains, showing that SSL can compensate for lower-resolution representations. ESM-2 remained highly robust, achieving F1 scores above 0.82 with only 25% labeled data, indicating that its embeddings capture key antigenic determinants. While H1N1 and H9N2 were predicted with high accuracy, the hypervariable H3N2 subtype remained challenging, although SSL mitigated the performance decline. These findings demonstrate that integrating PLMs with SSL can address the antigenicity labeling bottleneck and enable more effective use of unlabeled surveillance sequences, supporting rapid variant prioritization and timely vaccine strain selection.

artificial intelligence, machine learning, sequence, (16 more...)

arXiv.org Artificial Intelligence

2512.05222

Genre: Research Report > New Finding (1.00)

Industry: